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Abstract 

Background: Ten-Eleven Translocation (TETs)proteins mediate the oxidation of 5-methylcytosine (5mC) to 5- 
hydroxymethylcytosine (5hmC). Tetl is expressed at high levels in mouse embryonic stem cells (ESCs), where it 
mediates the induction of 5hmC decoration on gene-regulatory elements. While the function of Tetl is known, the 
mechanisms of its specificity remain unclear. 

Results: We perform a genome-wide comparative analysis of 5hmC in pluripotent ESCs, as well as in differentiated 
embryonic and adult cells. We find that 5hmC co-localization with Polycomb repressive complex 2 (PRC2) is 
specific to ESCs and is absent in differentiated cells. Tetl in ESCs is distributed on bivalent genes in two 
independent pools: one with Sin3a centered at non-hydroxymethylated transcription start sites and another 
centered downstream from these sites. This latter pool of Tetl co-localizes with 5hmC and PRC2. Through co- 
immunoprecipitation experiments, we show that Tetl forms a complex with PRC2 specifically in ESCs. Genome- 
wide analysis of 5hmC profiles in ESCs following knockdown of the PRC2 subunit Suzl2 shows a reduction of 
5hmC within promoter sequences, specifically at H3K27me3-positive regions of bivalent promoters. 

Conclusions: In ESCs, PRC2 recruits Tetl to chromatin at H3K27me3 positive regions of the genome, with 5hmC 
enriched in a broad peak centered 455 bp after the transcription start site and dependent on the PRC2 
component Suzl2. These results suggest that PRC2-dependent recruitment of Tetl contributes to epigenetic 
plasticity throughout cell differentiation. 
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Background 

In eukaryotic cells, 5-methylcytosine (5mC) occurs 
almost exclusively within a CpG context, and is catalyzed 
by the family of DNA methyltransferase (DNMT) 
enzymes [1-3]. More recently, a number of studies have 
identified a mechanism of DNA demethylation involving 
the oxidation of 5mC to 5-hydroxymethylcytosine 
(5hmC), which can function as a new epigenetic marker 
or as an intermediate toward further oxidative states by 
Ten-eleven translocation (TET) proteins [4-11]. In 
embryonic stem cells (ESCs) and in embryonic and adult 
tissues, 5hmC modification occurs at high levels, whereas 
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it is significantly reduced in cancer[12-17]. In ESCs, 
5hmC is principally catalyzed by Tetl, which has high 
expression in these cells. Genome-wide studies in ESCs 
have shown 5hmC enrichment on regulatory elements, 
such as promoters, enhancers, and gene bodies [18-27]. 
Tetl depletion in ESCs leads to both transcriptional 
activation, in accordance with its role in oxidation of 
5mC, and transcriptional repression [6,20,25,28-32]. 
Genome-wide analysis have shown that Tetl binding cor- 
relates with the transcriptional repressor Sin3a, which 
forms a nuclear complex with Tetl and also with the 
Polycomb repressive complex 2 (PRC2) [19,23,25,32]. 

PRC2 is the enzymatic complex that mediates the tri- 
methylation of lysine 27 of histone H3 (H3K27me3) on 
developmental genes that determine whether the chromatin 
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remains either open or fully inaccessible. It is formed by 
a core complex that includes Enhancer of Zeste 1 or 2 
homolog (Ezhl or Ezh2), Suppressor of Zeste 12 homolog 
(Suzl2), Embryonic ectoderm development (Eed), and 
other accessory subunits, many of which are specific to 
ESCs [33-41]. PRC2 is involved in a number of different 
biological processes, and its dysregulation is associated with 
carcinogenesis [33,42]. 

Using genome-wide analysis of 5hmC distribution in 
ESCs with respect to embryonic and adult tissues, we 
found that the overlap between the repressive modification 
H3K27me3 and 5hmC is ESC-specific, and we studied the 
molecular mechanism of the interplay between Tetl and 
PRC2 in ESC. 

Results 

Genome-wide distribution of 5hmC in ESCs and 
differentiated cells 

We mapped the genome-wide distribution of 5hmC 
using the glucosylation, periodate oxidation, biotinylation 
(GLIB) method followed by sequencing using an Illumina 
platform (GLIB-Seq) [18] in ESCs, primary mouse 
embryonic fibroblasts (MEFs), and two different tissues 
with distinct embryological derivation: brain, which is 
known to have high levels of 5hmC, and liver, which is a 
homogeneous tissue composed almost exclusively of 
hepatocytes. 

Heatmaps plotted using the gene expression levels 
obtained from RNA sequencing (RNA-Seq) analysis 
showed a similar distribution pattern in all samples, 
with 5hmC enrichment typically on gene bodies with 
respect to the transcription start site (TSS) (Figure lA). 
Classification of the genes by their level of expression 
showed an enrichment of 5hmC on upstream promoter 
regions and the gene bodies of genes with higher 
expression. By contrast, unexpressed genes or genes 
expressed at low levels showed an enrichment of 5hmC 
at the TSS, which was more evident in undifferentiated 
ESCs (Figure IB). To further investigate the relationship 
between hydroxymethylation and tissue specificity, we 
analyzed the distribution of 5hmC for genes expressed 
in the liver by separating them in two groups: liver-specific 
genes and housekeeping genes. Notably, liver-specific 
genes showed 5hmC enrichment only on the upstream 
region and along the genes in the liver itself, whereas they 
showed increased 5hmC at the TSSs in ESCs, MEFs, and 
brain tissue (Figure IC, left panels). Conversely, the house- 
keeping genes in all cell types had 5hmC distributed at 
upstream promoter regions and in the gene bodies but not 
at the TSS (Figure IC, right panels). 

For each cell type, we then plotted the expression level 
(reads per kb of exon per million mapped reads (RPKM) 
loglO) and the 5hmC intensity normalized for the gene 
length for each gene to further clarify the role of 5hmC 



in gene expression (see Additional file 1 Figure SI). This 
analysis identified 5hmC enrichment at genes with medium 
expression levels, with a peak around RPKM loglO value = 1 
(Figure IC), whereas genes that were not expressed 
(RPKM loglO value <-l) or expressed at very high levels 
(RPKM loglO value >3) showed little or no 5hmC. 

The examples (see Additional file 1: Figure SI) show 
the 5hmC distribution and transcript levels of the 
PouSfl and Albumin genes, which are highly expressed 
in ESCs and liver, respectively, and Eeflal, which is 
highly expressed in all four cell types. Taken together, 
these results show that 5hmC is enriched on the TSSs 
of genes that are not expressed or are expressed at low 
levels, whereas genes that are more highly expressed 
show an enrichment of 5hmC along the gene but not at 
TSSs, independent of the cell type. 

5hmC correlates with PRC2 and H3K27me3 In ESCs, but 
not In differentiated cells 

Recent studies in ESCs have highlighted 5hmC enrich- 
ment on specific genomic elements and in association 
with some histone modifications [18-20,24-26,29,32]. We 
extended this analysis by comparing ESCs with differen- 
tiated MEFs, and brain and liver tissues. Analysis of 5hmC 
density on promoters and gene bodies normalized accor- 
ding to base pair length showed that 5hmC was signi- 
ficantly enriched on promoters in ESCs but not in 
differentiated cells, (Figure 2A). 

As previously observed, the exons in ESCs showed 
higher levels of DNA hydroxymethylation than did the 
introns, and this difference was confirmed in the other cell 
types we analyzed (Figure 2B). By contrast, the DNA of 
active enhancers was enriched by hydroxymethylation in 
ESCs, whereas this correlation was inverted in differen- 
tiated cells (Figure 2C). We also found significant 5hmC 
enrichment in promoters with low or intermediate CpG 
content (LCP and ICP, respectively) compared with high 
CpG (HCP) content in all cell types analyzed (Figure 2D). 

Importantly, we found strong enrichment of 5hmC on 
H3K4me3K27me3 double-positive and H3K27me3- 
positive promoters compared with H3K4me3-positive 
promoters in ESCs but not in the other cell types (Figure 
2E). Further comparison of our data with published histone 
modifications and transcription factor occupancy confirmed 
that, in ESCs, 5hmC co-localizes with H3K27me3, as well 
as with Ezh2 and Suzl2 binding (Figure 2F; see Additional 
file 1: Figure S2) 

Analysis of the genome-wide distribution of 5hmC 
and H3K27me3 at promoters (± 5 kb) of all genes, 
ordered by expression level, showed that H3K27me3 is 
more abundant on promoters of genes with low expres- 
sion in all cell types. However, in ESCs only, H3K27me3 
promoters showed a significant enrichment of 5hmC 
(Figure 3A,B). 
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Figure 1 5hmC distribution in plurlpotent and differentiated cells. (A) Heatmaps of 5hmC occupancy on promoters and gene bodies in 
embryonic stem cells (ESCs), mouse embryonic fibroblasts (MEFs), and brain and liver tissue. Genes are ordered by the mRNA levels of each cell 
type obtained by RNA sequencing (RNA-Seq) analysis. (B) The 5hmC distribution profile in promoter regions and gene bodies of genes grouped 
into three equal sets by their mRNA expression level. (C) Heatmaps and distribution profiles of 5hmC in promoters and gene bodies of genes 
separated into two groups according to their expression level, as expressed only in liver (liver-specific) or expressed in in ESCs, MEFs, brain, and 
iver (housekeeping genes). (D) Scatter plot of 5hmC gene body density and expression level for each gene in ESCs, MEFs, brain, and liver. 
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Figure 2 ShmC distribution on genomic elements and histone modifications The percentage of 5hmC was normalized to tlie lengtli (bp) 
of eacli genomic segment considered. (A) ShmC was enriched on promoters (± 1 l<b) in embryonic stem cells (ESCs) (*P < 0.01 ).(B) ShmC was 
enriched on exons with respect to introns (*P < 0.01 ).(C) ShmC was significantly enriched in active enhancers in ESCs and in poised enhancers 
in differentiated tissues (*P < 0.01; **P < 0.05).(D) Promoters with low and intermediate presence of CpG islands (LCP and ICP, respectively) were 
preferentially marked by DNA hydroxymethylation with respect to promoters with high CpG (HCP) {*P < 0.01). (E) In ESCs, promoters with both 
H3K4me3 and H3K27me3 (K4/K27) and promoters with H3K27me3 but not H3K4me3 (K27) were significantly enriched in DNA 
hydroxymethylation compared with promoters with H3K4me3 but not H3K27me3 (K4) (*P < 0.01). (F) Analysis of correlations between ShmC 
and histone modifications in all four cell types (*P < 0.01). 



Tet1 bimodal binding profile in H3K27me3 positive 
promoters 

The above data show a correlation between 5hmC and 
H3K27me3 distribution on ESC promoters. Further ana- 
lysis of Tetl distribution, using published chromatin 
immunoprecipitation sequencing (ChlP-Seq) datasets 
[25] [32], on genes ordered by level of H3K27me3, 
showed that Tetl binding correlates with Ezh2 and 
Suzl2, as well as with Sin3a, a protein recently discovered 
in complex with Tetl [25] (Figure 4A). Approximately 
84% of Suzl2 and 47% of Sin3a binding sites were also 
bound by Tetl, whereas the overlap between Sin3a and 
Suzl2 was low (see Additional file 1, Figure S3). 

Moreover, Tetl-Sin3a co-bound peaks showed high 
CpG island content, whereas the number of CpG islands 
was significantly decreased in Tetl-H3K27me3 co-bound 
regions (see Additional file 1, Figure S3). 

Using unsupervised analysis of Tetl binding, with a 
k-means clustering algorithm based on the Pearson cor- 
relation coefficient across profiles, we generated binding 
profile curves across each promoter (± 5 kb). After 
separating the results into two clusters (k = 2), we 
obtained two different Tetl binding profiles: cluster 1, 



with a profile centered on the TSS and enriched at K4- 
only genes; and cluster 2, with a larger profile centered 
downstream from the TSS enriched on bivalent promoters 
(Figure 4B). Next, we plotted the Tetl peaks around the 
TSS. On H3K27me3-negative genes, the Tetl binding 
overlapped with Sin3a (Figure 4C, upper panel). Interest- 
ingly, on H3K27me3-positive genes, Tetl displayed a 
bimodal profile that correlated with Sin3a on the TSS and 
with Suzl2 downstream of the TSS (Figure 4C, lower 
panel). Fitting the Tetl binding curve on the TSS (± 5 kb) 
using a superposition of two Gaussian distributions 
(six parameters fit, see Methods) showed that the first 
Gaussian curve was narrower and centered 39 bp 
upstream of the TSS, whereas the second was broader and 
centered 455 bp downstream of the TSS (Figure 4D). 
Taken together, these data suggest that Tetl is present on 
bivalent genes in two distinct pools, one overlapping with 
Sin3a and the other with Suzl2. 

PRC2 interacts with Tet1 in ESC, and its binding is 
required for DNA hydroxymethylation at bivalent genes 

The data above suggest interplay between Tetl and 
PRC2 in ESCs. Next, we examined whether Tetl could 
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Figure 3 ShmC was enriched at H3K27me3 loci in embryonic stem cells (ESCs) but not in differentiated cells. (A) Heatmaps of 5hmC 
and H3K27me3 of genomic regions around tlie TSS (± 5 kb) in ESCs, mouse embryonic fibroblasts (IVlEFs), brain, and liver, rank-ordered by 
mRNA expression level of each cell type. (B) Profile distribution of ShmC and H3K27me3 at the TSS (± 5kb) in H3K27me3-positive genes. 



form a complex with PRC2 by co-immunoprecipitation 
of the endogenous proteins from ESCs and MEFs. Tetl 
was co-immunoprecipitated with the PRC2 subunits 
Suzl2 and Ezh2 in ESCs (Figure 5A). The reciprocal 
immunoprecipitation experiment with a Suzl2 antibody 
showed association of PRC2 with Tetl (Figure 5B). 
Notably, we did not observe PRC2 co-immunoprecipitation 
with Tetl in MEFs (Figure 5A,B)> whereas SinSa was 
co-immunoprecipitated with Tetl in both cell lines, 
thus confirming that the physical connection between 
Tetl and PRC2 complex is limited to ESCs. To test 
whether the Tetl/PRC2 interaction is necessary for DNA 



hydroxymethylation at H3K27me3 regions, we silenced 
Suzl2 in ESCs using two different small hairpin (sh)RNAs. 
Both constructs decreased Suzl2 protein and H3K27me3 
levels without affecting the levels of Tetl or Sin3a (Figure 
5C; see Additional file 1: Figure S3). Suzl2 silencing 
resulted in a significant reduction of global DNA hydroxy- 
methylation (Figure 5D; see Additional file 1: Figure S3). 

Genome-wide analysis of 5hmC profiles in control and 
Suzl2 knockdown ESCs showed a reduction of ShmC at 
the promoter regions of H3K27me3-positive genes within 
the Tetl co-bound regions but not at H3K27me3-negative 
regions (Figure 6A,B; see Additional file 1, Figure S4). 
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Figure 4 Bimodal distribution of Tetl on bivalent genes. (A) Heatmaps of H3K4me3, H3K27me3, 5hmC, Ezh2, Suz12, Tetl-C [25], Tetl-N [25], 
Tetl-Wu [32], Sin3a #1 (Abeam), and Sin3a #2 (Santa Cruz Biotechnology) on the transcription start site (TSS) of genes ranl<-ordered by their 
H3K27me3 level. (B) Distribution frequency of Tetl binding obtained by clustering of Tetl binding data by l<-means algorithm revealed two 
different Tetl occupancy profiles on gene promoters.(C) Tetl, Sin3a and Suz12 occupancy around the TSS (± 2 kb) of H3K27me3-negative 
(upper panel) or H3K27me3-positive (lower panel) genes. (D) Fitting of the Tetl binding in H3K27me3-positive genes revealed two Gaussian 
profiles, one centered -39 bp upstream from the TSS, and the other centered +455 bp downstream from the TSS. 
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Figure 5 PRC2 interacts with Tetl and is required for correct DNA hydroxymethylation. (A) Nuclear extracts of embryonic stem cells 
(ESCs) or mouse embryonic fibroblasts (MEFs) were immunoprecipitated with anti-IgG or anti-Tetl antibodies. Western blotting analysis was 
performed using the antibodies indicated. For each input, 1% of nuclear extract was loaded. (B) Nuclear extracts of ESCs or IVlEFs were 
immunoprecipitated with anti-IgG or anti-Suz12 antibodies. Western blotting analysis was performed using the antibodies indicated. For each 
input, 2% of nuclear extract was loaded. (C) Western blotting analysis of extracts from the control (small hairpin green fluorescent protein; 
shGFP) or Suz12 knockdown (shSuz12) ESCs was performed using the antibodies indicated. (D) Dot-blot analysis and signal quantification of 
ShmC and 5mC in DNA extracted from control or Suzl2 knockdown ESCs. Single-stranded (ss)DNA was used as a loading control. The 
experiments were performed in triplicate. 
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Chip analysis of Tetl binding to the bivalent genes 
Kdr and Hoxal and the non-bivalent genes Cdc25a and 
PouSfl in wild-type and Suzl2-silenced cells showed a 
reduction of Tetl binding only at Tetl-PRC2 co-bound 
regions (Figure 6C; see Additional file 1: Figure S4). 
Taken together, these results indicate that PRC2 is 
required for recruitment of Tetl and DNA hydroxy- 
methylation on bivalent promoters. 

Discussion 

We compared the genome-wide distribution of 5hmC in 
ESCs, MEF, and terminally differentiated brain and liver 
tissues by GLIB-Seq analysis, which identified a similar 
enrichment of 5hmC on promoters, enhancers, and 
gene bodies, with a preference for exons in all cell types. 
A major difference between ESCs and other cell types is 
the correlation between 5hmC and H3K27me3, which is 
unique to ESCs, whereas this correlation is not present in 
differentiated fibroblasts or in adult tissues. We found 
that this difference is due to an ESC-specific functional 
interplay between Tetl and the PRC2 complex. 

We found that in ESCs, but not in differentiated cells, 
PRC2 recruits Tetl to the chromatin of bivalent genes 
to maintain their hypomethylated state. First, through 
in-depth analysis of previous ChlP-Seq data in ESCs, we 
found that Tetl binds to chromatin with two different 
binding profiles: one, which is characterized by narrow 
peaks, is centered on the TSS of almost all genes and over- 
laps with SinSa, while the other, which is characterized by 
broader peaks, is centered downstream of the promoters 
of bivalent genes and overlaps with PRC2. Second, co-im- 
munoprecipitation showed that Tetl interacts with PRC2 
in ESCs but not in fibroblasts. Third, in ESCs, Suzl2 
silencing affected Tetl binding and 5hmC modification 
at bivalent promoters specifically at PRC2-positive 
regions but not at other regions. 

Thus, our results show that Tetl, besides binding at 
the TSS together with SinSa, is recruited by PRC2 
downstream from the TSS at bivalent genes. The differ- 
ence in the peak shape of the two binding profiles is com- 
patible with the different recruitment mechanisms. In one 
case, Tetl binds directly to the DNA on the TSS via its 
CXXC domain, which has high affinity for clustered 
unmethylated CpG dinucleotides [11], whereas in the 
other case, Tetl is recruited by PRC2 to the chromatin. In 
this latter case, the binding of Tetl and the distribution of 
5hmC occupy larger regions, owing to the typical spread 
of PRC2 binding on bivalent promoters. Interestingly, 
Tetl, together with SinSa, localizes to unmethylated TSSs, 
suggesting that, at these regions, this complex binds to 
unmethylated CpG or catalyzes the oxidation of 5mC to 
completion. Conversely, Tetl binding with PRC2 overlaps 
with 5hmC on bivalent genes. Thus, our results are com- 
patible with a model by which Tetl-dependent oxidation 



of 5mC is finely regulated to either eliminate DNA methy- 
lation or generate 5hmC epigenetic marks in different 
regions. Previous findings that MeCP2, Np95, and Mbd3 
recognize 5hmC [10,43-45] imply that this DNA modifi- 
cation can be a specific epigenetic signal. 

We found that PRC2 depletion from ESCs reduced Tetl 
binding on bivalent promoters. This result differs from 
previous studies reporting that Ezh2 knockdown did not 
affect Tetl binding [32]. This discrepancy could be 
because depletion of Ezh2 does not impair PRC2 binding 
to the chromatin or its methylation activity, because in 
ESCs, Ezhl can complement Ezh2 function [39]. By con- 
trast, knockdown of Suzl2 destabilizes the complex [37]. 

We found that Tetl/PRC2 co-immunoprecipitation is 
ESC-specific, as we could not co-immunoprecipitate Tetl 
with PRC2 in fibroblasts, and previous experiments failed 
to detect PRC2 proteins interacting with Tetl in HEK293 
cells [25]. These results suggest that the functional interac- 
tion between Tetl and PRC2 in ESCs is either indirect, 
possibly mediated by one of the ESC-specific cofactors, or 
is dependent on post-translation modifications. 

We also found that the distribution of 5hmC on gene 
bodies is not tissue-specific, but increases with the level 
of gene expression in all cell types analyzed, suggesting 
that 5hmC acts as a positive activator by reducing the 
5mC level. This regulation is confirmed by the recent 
finding that 5hmC was increased in gene bodies that 
were transcriptionally upregulated in a model of neuro- 
nal differentiation [46]. However, we found that genes 
expressed at very high levels showed little or no 5hmC, 
suggesting that DNA hydroxymethylation is not neutral 
but is mildly inhibitory on the transcription process. 
These results agree with in vitro experiments showing 
that 5hmC modifications are mildly repressive when 
present in the gene body [47]. 

We reported that PRC2 depletion reduced Tetl bind- 
ing and the presence of 5hmC at promoters of bivalent 
genes in ESCs. A previous report showed that Tetl 
depletion results in the increased methylation and 
minor binding of PRC2 [32], most likely because Tetl- 
dependent demethylation facilitates PRC2 recruitment 
to the DNA, as methyl CpG counteracts the binding of 
PRC2. In fact, genome-wide analysis reported mutual 
exclusiveness of H3K27me3 with DNA methylation in 
CpG islands [48-51], and DNA methylation inhibits the 
binding of PRC2 in vitro [50]. Taken together, these 
results suggest a positive feedback loop between Tetl- 
dependent DNA demethylation and PRC2-dependent 
repression through H3K27me3 in ESCs in order to 
maintain developmental genes in the poised status. 
Thus, the presence of these two contrasting epigenetic 
markers on the same chromatin region is required to 
allow activation or repression of bivalent genes following 
action of developmental stimuli. 
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Figure 6 Polycomb repressive complex 2 (PRC2) is required for ShmC deposition and Tetl binding in Tet1-PRC2 co-bound regions. (A) 

Heatmaps and distribution profiles of 5iimC around the transcriptiorn start site (TSS) of genes rank-ordered by H3K27me3 levels in control or 
Suz12 knockdown embryonic stem cells (ESCs) See also Figure S3(C-E). (B) Sin3a, Tetl, H3K27me3, Suz12 occupancy and ShmC occupancy in 
control and Suzl2 knockdown cells on representative examples of bivalent (Kdr and Hoxal) and K4-only (Cdc2Sa and PouSfl) genes. Chromatin 
immunoprecipitation (ChIP) analysis of Tetl binding of genomic regions (a to n) as indicated in panel (A), in control or Suzl2 knockdown ESCs 
(*P < 0.01). 
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Conclusions 

Using genome-wide analysis, Suzl2 silencing, and 
immunoprecipitation experiments we found that in 
ESCs, but not in other cell types, PRC2 recruits Tetl to 
H3K27me3 regions. Our results identify a novel way by 
which Tetl is recruited onto the chromatin, indepen- 
dent of its ability to bind directly to CpG or to be 
recruited by Sin3a, and clarify the mechanism that links 
H3K27me3 modifications on the nucleosome and 5hmC 
on the DNA in ESCs. These findings highlight the 
mechanistic link between PRC2 and Tetl and contribute 
to our understanding of how these two epigenetic modi- 
fiers regulate the chromatin state of bivalent genes to 
maintain these bivalent genes in a poised state, ensuring 
their dynamic regulation during differentiation. 

Materials and methods 

Cell culture 

E14 mouse ES cells were cultured in high-glucose 
DMEM (Invitrogen Corp., Carlsbad, CA, USA) supple- 
mented with 15% PBS (Millipore Corp., Billerica, MA, 
USA), 0.1 mmol/1 nonessential amino acids (Invitrogen), 
1 mmol/1 sodium pyruvate (Invitrogen), 0.1 mmol/1 2- 
mercaptoethanol, 1500 U/ml Leukemia Inhibitory Factor 
(LIP; Millipore), 25 U/ml penicillin, and 25 |ig/ml strep- 
tomycin. MEPs were derived from 13.5 day pregnant 
female mice and cultured in high-glucose DMEM sup- 
plemented with 10% PCS. Mouse tissues were extracted 
from 8-week-old mice. Por silencing of Suzl2, ESCs 
were transfected as previously described [52] with the 
TRCN0000123889 and TRCN0000123891 vectors (Open 
Biosystems/Thermo Scientific Inc., Ptitsburgh PA, USA) 
and then maintained for selection for 3 days with 1 |Jg/ml 
puromycin. 

DNA extraction and GLIB-Seq 

Genomic DNA was extracted using a DNeasy Blood and 
Tissue kit (Qiagen Inc., Valencia, CA, USA). 

GLIB was precipitated using a Hydroxymethyl Collector 
(Active Motif, Carlsbad, CA, USA). Libraries were gener- 
ated with a ChlP-Seq Sample Prep Kit (Illumina Inc., 
San Diego, CA, USA) and sequenced on an HiScanSQ 
Platform (lUumina). 

Bioinformatic analysis 

Sequencing data were mapped to the mouse genome 
(mm9 assembly) using Bowtie (version 0.12.7), reporting 
only unique hits with up to two mismatches. Redundan- 
cies were collapsed, and peak calling was performed 
using MACS (version 1.4.1). For comparative analysis, 
we downloaded GEO Datasets data for the ESC histone 
modifications GSE12241 [53,54] and GSE11172 [55], 
and the transcription factors GSE11431 [56], GSE24843 
[25], and GSE26833 [19,32]. For histone modifications 



and mRNA expression, ChlP-Seq and RNA-Seq data on 
ESCs, MEPs, and brain or liver tissues were downloaded 
from the ENCODE project database [57], corresponding 
to the following GEO Datasets: GSE36025, GSE36026, 
and GSE31039. Datasets for ChlP-Seq of H3K27me3 on 
MEPs was downloaded from GSE12241. 

LCP-ICP-HCP promoters were defined as described by 
Weber et al. [58], H3K4me3-only and bivalent promoters 
were defined as described by Ku et al. [59], and active or 
poised enhancers were defined as previously described 
[60,61]. To obtain three groups of equal size, genes with 
RPKM less than 0.1, 0.1 to 5, and less 5 were categorized 
as having no, low, and medium/high expression, respec- 
tively. Liver-specific genes were defined as having RPKM 
less than 10 in liver and less than 1 in other cells. House- 
keeping genes were defined as RPKM greater than 10 in 
all cell types. 

Data mapped on the mouse mm8 assembly was trans- 
posed to the mm9 assembly using the liftOver tool. 
Heatmaps and comparative analyses were performed 
using custom Perl scripts. Clustering of Tetl binding 
profiles was performed using the MATLAB implementa- 
tion of the k-means algorithm using the Pearson correla- 
tion coefficient as the 'distance' between profiles. Using 
100 realizations of the clustering (using different random 
seeds), we foudn that the composition of clusters was 
remarkably stable. The fitting procedure of the average 
Tetl binding profile was performed using the gnuplot 
least squares fitting algorithm. The test function was a 
superposition of two (non-normalized) Gaussian distribu- 
tions (six parameters fit: two means, two variances, one 
relative weight, one overall amplitude factor). 

Nuclear protein extractions 

Cells were harvested in Ix PBS and resuspended in iso- 
tonic buffer (20 mmol/1 HEPES pH 7.5, 100 mmol/1 
NaCl, 250 mmol/1 sucrose, 5 mmol/1 MgCl2, 5 pmol/l 
ZnCl2), then cells were resuspended in isotonic buffer 
supplemented with 1% NP-40 to isolate nuclei. The 
isolated nuclei were resuspended in digestion buffer 
(50 mmol/1 Tris-HCl pH 8.0, 100 mmol/1 NaCl, 250 
mmol/1 Sucrose, 0.5 mmol/1 MgC12, 5 mmol/1 CaCl2, 
5 [imol/1 ZnCl2), and treated with microccocal nuclease 
at 30''C for 10 min. 

Immunoprecipitations 

Nuclear proteins were incubated with 3 i^g of specific 
antibody overnight at 4°C. Immunocomplexes were 
incubated with Protein G-conjugated magnetic beads 
(Dynal; Invitrogen) for 2 h at 4°C. Samples were washed 
four times with digestion buffer supplemented with 0.1% 
NP-40 at room temperature. Proteins were eluted by 
incubating with 0.4 M NaCl TE buffer for 30 min, and 
analyzed by western blotting as previously described [62] . 
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Chromatin immunoprecipitation and dot-blot analysis 

Each Chip experiment was performed at least three inde- 
pendent times, as previously described [52]. Oligonucleo- 
tide sequences are shown in the supplementary data. See 
Additional file 1. Genomic DNA for the dot-blot analysis 
was sonicated for 15 cycles and denatured with 0.4 M 
NaOH, then incubated for 10 minutes at 95°C before 
being spotted onto Hybond™-N* membranes (GE Health- 
care, Little Chalfont, Buckinghamshire, UK). 

Antibodies 

The antibodies used were anti-Tetl, anti-H3K27me3, 
anti-H3K4me3 (all Millipore), anti-ssDNA (Abeam, 
Cambridge, MA, USA), anti-Suzl2and anti-Ezh2 (both- 
Cell Signaling Technology, Danvers, MA, USA), anti- 
actin (Sigma-Aldrich, St Louis, MO, USA), anti-Sin3a 
(SantaCruz Biotechnology Inc., Santa Cruz, CA, USA), 
and anti-5mC and anti-5hmC (both Active Motif). 

Data access 

The Gene Expression Omnibus accession number for 
5hmC profiling of ESC, MEF, brain and liver reported in 
this paper is GSE44566. 

Additional material 



Additional file 1: PDF document containing four supplemental 
figures (Figures S1 to S4), and one supplemental table (Table S1). 
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